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NUCLEOTIDE SEQUENCES ENCODING THE TOMATO LIGHT 
HYPERSENSITIVE PHENOTYPE, ENCODED PROTEINS AND USES THEREOF 

Description 

The present invention relates to nucleotide sequences 
5 encoding the tomato light hypersensitive phenotype, encoded 
proteins and uses thereof. 

In particular the present invention relates to 
nucleotide sequences encoding a protein, whose qualitative 
or quantitative modification and/or inhibition in plants 

10 induces high levels of carotenoids and/or flavonoids and/or 
chlorophylls, in comparison with wild- type plants; the 
invention also relates to the use of these nucleotide 
sequences for the production of engineered plants to be 
employed in the agro-industrial sector. 

15 Light is a critical environmental signal that 

controls many aspects of plant growth and development. It 
is perceived by a sophisticated series of photoreceptors: 
the phytochromes, which absorb red and far red light, the 
cryptochromes, which absorb blue and UV-A light 

20 wavelengths, and the UV-B receptors (Mustilli and Bowler, 
1997) . Together with endogenous hormonal signals, these 
photoreceptors regulate the developmental changes known as 
photomorphogenesis • Photomorphogenesis is defined as the 
influence of light on plant development and comprises leaf 

25 and chloroplast development and the regulation of 
photosynthetic apparatus components, by means of the 
coordinated expression of both nuclear and cytoplasmic 
genes. Moreover, due to the light response, photoprotectant 
pigments such as flavonoids are also produced. The 

30 modifications occurring during photomorphogenesis have been 
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characterized by studying light effects on Arabidopsis 
seedling development (von Arnim and Deng, 1996) . Light- 
grown Arabidopsis seedlings display short hypocotyls, open 
and expanded cotyledons and the expression of light- 
regulated genes which are responsible for flavonoid and 
chlorophyll biosynthesis (e.g. chalcone synthase, CHS; 
chlorophyll A/B binding protein, CAB) . Dark-grown seedlings 
display elongated hypocotyls, closed cotyledons and 
repression of light-regulated genes. 

In higher plants, the phytochromes are encoded by a 
gene family (e*g. PHYA-E in Arabidopsis r Sharrock and 
Quail, 1989; Clack et al • , 1994) and although they are the 
best characterized photoreceptors, relatively little is 
known about how the light signals perceived by phytochromes 
are transduced to the nucleus to activate the various 
developmental, physiological and molecular responses to 
light. Recently, biochemical studies using microinjection 
into cells of the phytochrome deficient aurea (au) tomato 
mutant, along with pharmacological studies in 
photomixotrophic soybean cell cultures, have implicated 
heterotrimeric G-proteins, cGMP, calcium and calmodulin as 
intermediates in phytochrome signal transduction pathways 
(Bowler and Chua, 1994; Mustilli and Bowler, 1997) . In 
parallel, several genetic screens have been developed to 
identify mutants potentially affected in light signal 
transduction (Chamowitz and Deng, 1996) . Most of the 
photomorphogenic mutants have been characterised in 
Arabidopsis and can be classified as either insensitive or 
constitutive mutants. Insensitive mutants display a light- 
blind elongated phenotype in the light. Some are mutated in 
the photoreceptors themselves, whilst others are presumed 
to encode positive regulators of light signal transduction 
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pathways (Chamovitz and Deng, 1996; Chory et al., 1996; 
Whitelam and Harberd, 1994) . Conversely, constitutive de- 
etiolated mutants (e.g. cop/det/fus/cpd) display light 
grown morphologies when grown in the dark together, in some 
5 cases, with the inappropriate expression of light regulated 
genes such as CAB and CHS (Millar et al . , 1994; Szekeres et 
al., 1996). The recessive nature of these mutations 
suggests that they are loss-of -function and that the wild- 
type genes are repressors of photomorphogenesis in 

10 darkness. However, although epistasis tests with 
phytochrome-def icient mutants have indicated that they 
function downstream of phytochrome, they are not 
specifically mutated in phytochrome signal transduction 
because many have altered tissue specificities as well as 

15 other additional phenotypes not directly related to light 
(Mayer et al., 1996; Chory and Peto, 1990; Millar et al., 
1994; Szekeres et al., 1996). It is therefore not clear how 
COP/DET/FUS/CPD proteins function in the signal 
transduction pathways defined biochemically (Bowler and 

20 Chua, 1994) . 

A more targeted approach to identify specific 
components of signal transduction pathways specific for 
phytochrome could be the isolation of mutants with altered 
dynamics of light responses, rather than mutants with 

25 constitutive phenotypes in the absence of light. Several 
such light hypersensitve mutants have already been isolated 
in tomato (denoted hp-I, hp-2, atv, Tp; Kendrick et al . , 
1994) . In particular, the recessive non-allelic iip-I and 
hp-2 mutants have been characterized by their exaggerated 

30 light responsiveness, displaying higher anthocyanin levels 
(a flavonoid subgroup), shorter hypocotyls and more deeply 
pigmented fruits than wild-type plants. These mutants were 
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first identified in 1917 (Reynard, 1956) and in 1975 
(Soressi, 1975), respectively* Recently, hp-l"^ (Peters et 
alw 1989) and (Van Tuinen et al., 1997) mutants have 

been isolated and identified as new hp-1 and hp-2 alleles, 
5 respectively. Because these phenotypes appear to be 
identical to those obtained by ectopic expression of 
phytochrome A {PHYA) in tomato (Boylan and Quail, 1989), it 
would appear that the hp mutation may affect fairly 
specifically phytochrome responses. The recessive nature of 

10 the mutations, coupled with results from epistasis tests of 
hp-1 with the phytochrome deficient tomato mutants aurea 
(au) , phyA {fri) , and phyB (tri), have suggested that HP 
genes encode negative regulators of light signal 
transduction mechanisms, acting downstream of both PhyA and 

15 PhyB (Kerckhoffs et al., 1997). The fact that no 
counterparts of hp mutants have been isolated so far in 
Arahidopsls, along with the observation that in tomato 
anthocyanin production and the expression of photoregulated 
genes (e.g., CHS and CAB) is strictly light-dependent, 

2 0 indicates the importance of hp mutants for studying 
phytochrome-dependent signal transduction . Furthermore, 
microinjection-based studies using the au tomato mutant 
have shown that tomato is an excellent model system to map 
the role of individual components in the phytochrome 

25 activated signalling cascade (Bowler and Chua, 1994) . 

Therefore the identification and characterization of hp 
genes is likely to be very important for studying the 
regulation of photomorphogenesis in plants. 

The authors of the present invention have cloned the 

30 tomato gene and have studied at the molecular level 

the role of the HP-2 protein during the modulation of 
photomorphogenesis and fruit development. The authors have 
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found that the tomato HP-2 gene exhibits high sequence 
homology with the Arabidopsls DETl gene, which belongs to 
the above described constitutive COP/DET/FUS mutant group* 
Therefore the tomato HP-2 gene has been renamed TDETl . 
5 The authors have used Solanum lycopersicum (tomato) 

species plants, but those skilled in the art will recognize 
that the cloning could be repeated with no inventive 
efforts with other plant species, as but not limited to 
pepper, eggplant, soybean, grape, melon, rice, carrot, 

10 spinach, citrus, pomaceae and ornamental species. The 
authors have cloned and sequenced the gene responsible for 
the tomato hp mutation {high pigment) , which causes a light 
hypersensitive phenotype, thus enhancing carotenoid, and/or 
chlorophyll and/or flavonoid pigment levels. The gene is 

15 the first to be identified that causes such a mutant 
phenotype . 

hp mutants potentially have a direct application in 
the agro-industrial sector, for generating tomato fruits 
with high carotenoid and/or flavonoid contents. In 

20 particular, in tomato fruits of hp mutants, a high content 
of the carotenoid lycopene as well as other carotenoids and 
flavonoids has been observed (Thompson, 1955; Yen et al • , 
1997) . However, up to now the use of hp mutants in the 
agro-industrial sector, even if bred into various 

25 commercial varieties, has been impaired because of the fact 
that the hp mutation generates other undesirable 
phenotypes, such as reduced internode length and reduced 
plant vigour. 

The cloning of the TDETl gene and its use by means of 
30 gene transfer technologies offers considerable advantages 
with respect to conventional breeding techniques. It is 
possible to transfer genes suitable for agriculture between 
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different species and to inactivate and/or engineer genes 
of the same species, even only in specific plant tissues. 
Furthermore gene transfer is a much more rapid technique 
than conventional breeding. 

The cloning of the TDETl gene now allows the 
production of engineered plants which exhibit all the 
favourable features of the hp mutation, with none of the 
undesirable side-effects. Such plants could belong to the 
Solanum genus, but those skilled in the art will recognize 
that, with no inventive effort, it is possible to transfer 
the gene, or parts thereof, subcloned in suitable 
expression vectors, into plants belonging to different 
genera, as e.g. but not limited to pepper, eggplant, 
soybean, grape, melon, rice, carrot, spinach, citrus, 
pomaceae and ornamental species. 

It has been well recognized that a diet rich in 
lycopene and other carotenoids, or their administration in 
the form of pills, produce favourable effects on human 
health. As a matter of fact, carotenoids have antioxidant 
properties, B-carotene is a pro-vitamin A and lycopene is 
known to be an effective antitumoral agent (Hartley and 
Scolnik, 1995; Giles and Ireland, 1997; Hoffmann and 
Weisburger, 1997; Pappalardo et al . , 1997; Pool-Zobel et 
al., 1997; Rock et al . , 1997; Sharoni et al., 1997; Stahl 
and Sies, 1996) . Therefore the engineered plants of the 
present invention can be advantageously utilized as a 
source of such compounds, either as fresh or processed 
foods or as nutraceuticals . 

Furthermore hp-2 mutants have high levels of 
flavonoids such as anthocyanins (Von Wettstein-Knowles, 
1968), which are also considered to be excellent 
antioxidants and which exhibit in some cases antitumoral 
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properties (Fotsis et al . , 1997/ Rice-Evans et al.^^ 1997). 
Furthermore some flavonoids exhibit a role in plant 
protection against pathogenic agents and UV light 
irradiation (Shirley, 1996) . 
5 Therefore the manipulation of TDETl activity can be 

used to modify carotenoid and/or flavonoid content in 
several plant species (e.g., tomato, pepper, eggplant, 
soybean, grape, melon, rice, carrot, spinach, citrus, and 
pomaceae) for biotechnological uses in both the biomedical 

10 and agro-industrial sectors. 

Furthermore the manipulation of TDETl gene expression 
can be used to modify the anthocyanin and carotenoid 
content in ornamental species for the achievement of new 
colour variants (Griesbach, 1984) . 

15 In addition, because it has been recognized that a 

high content of carotenoids improves resistance to 
Norf lurazon-type herbicides, a further application of the 
TDETl gene is the production of transgenic plants by 
selection using herbicides rather than antibiotic compounds 

20 (Misawa et al., 1993). 

Furthermore in the same plant it is possible to 
combine a modified TDETl activity with mutations such as 
rin, nor and Nr^ which interrupt the fruit ripening 
process, or with biosynthetic genes of the carotenoid 

25 biosynthesis pathway (Bartley and Scolnik, 1995) , to obtain 
varieties exhibiting new qualitative characteristics for 
agro-industry . 

The attainment of an hp mutant phenotype by means of 
a biotechnological approach can be carried out 

30 advantageously by means of the inhibition of TDETl 
activity- Currently the best method for reducing gene 
expression is through the introduction, by gene transfer. 
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of the antisense sequence of the same gene or part thereof 
under the control of appropriate regulatory sequences. The 
use of such techniques in plants has been carried out in 
several existing examples (Oeller et al.^ 1991; Penarrubia 
5 et al., 1992), but those skilled in the art will recognize 
that alternative techniques can be used, without departing 
from the scope of the present invention. 

Furthermore, by using specific vectors such as, for 
example, but not limited to, pE8mGFP4, which contains 

10 regulatory sequences of the tomato E8 gene (Fig. 9) 
(Deikman and Fischer, 1988), the inactivation of the TDETl 
gene can be specifically modulated in the tomato fruit. 
Those skilled in the art will recognize that other 
regulatory sequences, e.g., originating from the 

15 polygalacturonase gene (PG) (Nicholass et al . , 1995), can 
be used in the place of the E8 gene promoter to obtain 
specific tissue modulation of TDETl gene expression. 

Within the scope of the present invention the term 
""light hypersensitive phenotype" means reduced plant growth 

20 associated with high levels of carotenoids and/or 
chlorophylls and/or flavonoids. 

The term ""protein or functional parts thereof, 
responsible for the light hypersensitive mutant phenotype" 
means an amino acid sequence which, if structurally or 

25 otherwise altered, induces a light hypersensitive phenotype 
in a plant. 

Therefore one object of the present invention is a 
nucleic acid comprising a nucleotide sequence encoding a 
protein, or functional parts thereof, which, if modified, 
30 is responsible for the light hypersensitive mutant 
phenotype in Solanum lycopersicum plants, said phenotype 
including reduced plant growth associated with high levels 
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of carotenoids and/or chlorophylls and/or flavonoids* 
Nucleic acids encoding proteins which are homologous to 
proteins of the Arabidopsis COP/DET/FUS family, which when 
modified result in a light hypersensitive phenotype, are 
within the scope of the present invention. 

Preferably the nucleic acid comprises the nucleotide 
sequence encoding the TDETl protein, or functional parts 
thereof. More preferably the nucleic acid comprises a 
nucleotide sequence encoding the protein having the amino 
acid sequence of SEQ ID No. 2 or functional portions 
thereof. More preferably the nucleic acid has a nucleotide 
sequence comprised in SEQ ID No. 1, more preferably from 
nt. 149 to nt. 1720. Alternatively the nucleic acid has a 
nucleotide sequence complementary to SEQ ID No. 1, or parts 
thereof. 

In one aspect of the invention the nucleic acid of 
SEQ ID No. 1 comprises a mutation which is able to induce 
the light hypersensitive phenotype; preferably at least a 
C-^T substitution in position 1640; alternatively the 
nucleic acid of SEQ ID No. 1 is deleted at least from nt. 
1581 to nt. 1589. 

A further object of the present invention is an 
expression vector including, under the control of an active 
and inducible plant promoter, the nucleic acid of the 
invention. Preferably the promoter is active only in some 
plant organs, more preferably in fruits. Preferred vectors 
are able to drive the transcription of an antisense RNA, 
for example pBIN-E8-'HP2-ASl and pBIN-E8-HP2-AS2 . 

A further object of the invention is the use of the 
vectors of the invention for producing transgenic plants 
which comprise the nucleic acid, under the control of 
specific regulating sequences, preferably in preselected 
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plant organs. Plants can belong to pepper, eggplant, 
soybean, grape, melon, rice, carrot, spinach/ citrus, 
pomaceae and ornamental species . 

A further object of the invention is a transgenic 
5 plant, or parts thereof, achievable by transformation with 
the vectors of the invention. Plants can belong to pepper, 
eggplant, soybean, grape, melon, rice, carrot, spinach, 
citrus, pomaceae and ornamental species. 

Plant genetic transformation techniques are known to 

10 those skilled in the art and comprise, but are not limited 
to transformation by Agrobacterium, electroporation, 
microinjection, or bombardment with DNA coated 
particles (Christou, 1996) . 

In a further aspect the invention includes a protein, 

15 or functional parts thereof, whose modification is 
responsible for the light hypersenstive phenotype in 
Solanum lycopersicum plants. Proteins homologous to the 
Arahidopsis COP/DET/FUS family are within the scope of the 
present invention, provided that, when modified, they 

20 result in a light hypersensitive phenotype. Preferably the 
protein comprises the amino acid sequence of SEQ ID No. 2 
or parts thereof. 

In one aspect of the invention the protein comprises 
a modification which is able to induce the light 

25 hypersensitive phenotype; preferably at least a 
modification in its C-terminal portion; more preferably a 
replacement of proline at position 498, most preferably 
serine as a substitute for proline; alternatively a 
deletion of at least one amino acid, preferably at least of 

30 three amino acids, more preferably from aa. 478 to aa. 480 
of SEQ ID No. 2. 
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The present invention will be described by reference 
to explanatory, but not limiting, examples, wherein 
reference will be made to the following figures: 

Figure 1. Phenotypes of wild type, iip-2, and 
5 tomato seedlings grown in the presence or in the absence of 
light. a) Hypocotyl length of seedlings grown in the 
absence (Dark) and in the presence of light (Light) • b) 
Content of anthocyanins in the hypocotyls (dashed bars) and 
in the cotyledons (empty bars) of seedlings grown in the 
10 light. Data represent average values from ten seedlings. 
Following a three day pre-germination period in darkness, 
seedlings were grown either in darkness (Dark) or in a 16 
hour light, 8 hour dark photoperiod (Light) for 6 days. 

Figure 2. TDETl {HP''2) cDNA nucleotide sequence (SEQ 
15 ID NO. 1) . The sequence encoding the TDETl protein is 
depicted in capital letters. The sequence of the RFLP CT151 
marker covers nt. 830 to nt . 2000. 

Figure 3. Amino acid sequence of the TDETl (HP-2) 
protein (SEQ ID No. 2) . Amino acids are depicted in the 
20 standard one letter code for amino acids. 

Figure 4. Exon-intron structure of the tomato TDETl 
(HP-2) gene and sites of mutations in hp~2 and hp-2^ mutant 
alleles. Stippled boxes indicate exons . Exons correspond to 
the following sequences of SEQ ID No. 1 (Figure 2): Exon 1: 
25 nt. 149 to nt . 220; Exon 2: nt . 221 to nt. 648; Exon 3: nt . 

649 to nt. 877; Exon 4: nt . 877 to nt. 1012; Exon 5: nt . 
1013 to nt. 1081; Exon 6: nt . 1082 to nt . 1138; Exon 7: nt . 
1139 to nt. 1219; Exon 8: nt . 1220 to nt . 1384; Exon 9: nt . 
1385 to nt. 1480; Exon 10: nt . 1481 to nt. 1580; Exon 11: 
30 nt. 1581 to nt. 1720; b) Donor and acceptor splice sites of 
intron 10 from the wild type and the iip-2 mutant. Brackets 
indicate splice sites, dots indicate internal intron 
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sequence (not shown) , and amino acids are shown in the one 
letter code. The asterisk in the hp-2 sequence denotes the 
site of the mutation. 

Figure 5. Alignirient of the amino acid sequences of 
5 5. lycopersicum TDETl, Arabidopsls DETl, and deduced 
mammalian EST amino acid sequences with homology to DETl 
(determined using Clustal method; Higgins and Sharp, 1988) . 
The putative bipartite NLS is overlined. The amino acids 
missing in the hp-2 mutant are indicated with asterisks, 

10 and the amino acid substitution in hp-2J is denoted by a 
plus sign. The mammalian sequences are a compilation of 
derived amino acid sequences from mouse and human ESTs 
(GenBank accession numbers AA756238, AA236057, AA050184, 
and W64359) . Boxed residues indicate conserved amino acids, 

15 dashes indicate arbitrary insertions. 

Figure 6. Effect of cytokinins on wild type tomato 
seedlings. (a) and (b) Hypocotyl length (cm) and anthocyanin 
content (per mg fresh weight [FW] ) , respectively, of wild 
type tomato seedlings grown in the presence of 0, 1, 5, 20, 

20 100, and 500 |ag/L benzoaminopurine . Seedlings were grown at 
25 ""C for 5 days in absolute darkness (Dark) or in a 16~hr 
light 8-hr dark photoperiod (Light) . In (a) values are the 
mean of 10 seedlings. Highly similar results were obtained 
with the cytokinin zeatin (data not shown) . 

25 Figure 7. Phenotype of the au hp''2 double mutant 

phenotype. Hypocotyl length (cm) and anthocyanin 
accumulation (per seedling) in wild type, au, hp'-2r and the 
au hp-2 double mutant are shown. Seedlings were grown at 2 5 
°C for 6 days in a 16-hr light 8-hr dark photoperiod. 

30 Values are the mean of 15 seedlings, and the experiments 
were repeated three times. 
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Figure 8. Gene expression in wild type, and hp- 

2^ seedlings. Seedlings were grown at 25 °C for 5 days in 
absolute darkness (D) , followed by 2 days in continuous 
white light (L) . The RNAs were extracted from whole 
5 seedlings (dark) or from cotyledons (C) and hypocotyls (H) 
(light) • Modifications in gene expression in hp-2 and hp-2^ 
seedlings compared with wild type seedlings (wt) are shown 
to be principally light-dependent. Ten microgram samples of 
total RNA were loaded on gels and analyzed for expression 
10 of CHSl {CHS), CAB6 (CAB), PRl-lbl (PR) genes following RNA 
gel blotting. 303 rRNA is shown as a control for loading. 

Figure 9. Binary plasmids to be transferred to 
plants - 

Materials and Methods 

15 Plant material and growth conditions 

The hp-2 and hp-23 exaggerated photoresponse mutants, 
the au hp'-2 double mutant, and the corresponding wild type 
tomato seeds {Solarium lycoperslcum cv Money Maker) were 
kindly provided by R. E. Kendrick and M. Koornneef 

20 (Wageningen Agricultural University, the Netherlands) . 

Seeds were surface sterilized and directly sown in magenta 
boxes (Sigma) containing 4.3 g/L Murashige-Skoog salts 
(Sigma) and 0.8 % agar. After 2 days pregermination in 
darkness, seedlings were grown at 25'' C either in a 16 hour 

25 light, 8 hour dark photoperiod or in continuous dark, as 
appropriate. For cytokinin treatment, seeds were sown in 
the presence of different concentrations of 
benzylaminopurine (Sigma) . 
Anthocyanin assay 

30 Anthocyanins were extracted from cotyledons, 

hypocotyls, and whole seedlings with 0.5 mL acidified (1% 
HCl) methanol for 48 hr in darkness with shaking. The 
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extracts were separated by the addition of 0.4 itiL of H2O 
and 1 luL of chlorof oriri;. followed by centrif ugation for 5 
min at 3, 000 rpm. The absorbance of the upper phase was 
determined spectrophotoinetrically at 535 nm (A535) , and the 
5 anthocyanin content was calculated as {A535)/mg fresh 
weight or (A535 } /seedling . 
Carotenoid and Chlorophyll assay 

In order to determine the content of carotenoids and 
chlorophyll in the tomato fruit,. pericarp sections of 

10 unripe (25 days after pollination) or ripe (50 days after 
pollination) fruit were weighed and incubated for 48 hours 
at 65°C in DMSO (dimethyl sulfoxide) in the absence of 
light. The contents of carotenoids and chlorophylls were 
determined by HPLC and values are reported as ]ig/g fresh 

15 weight. 

DNA and RNA Extraction 

DNA and RNA extractions from leaf samples were 
carried out according to Dellaporta et al . (1983) and 
Verwoerd et al . (1989), respectively. 

20 RNA gels (10 ]ig per lane) were blotted onto Hybond N+ 

membranes (Amersham) and hybridized with random primed 
probes (see below) . Hybridization was performed for 24 hr 
at SO'^C in phosphate buffer (7% SDS, 0 . 5M Na2HP04, pH 7.2, 
ImM EDTA) , followed by 20-min washes in 40mM NaP04 pH 7.2, 

25 1% SDS, and ImM EDTA. Probes were the tomato chalcone 
synthase gene CHSl (O'Neill et al., 1990), chlorophyll a/b 
binding protein gene CAB6 (Piechulla et al., 1991) and 
pathogenesis-related protein gene PR-lbl (Tornero et al . , 
1997) . All RNA gel blots were repeated at least twice and 

30 using different samples. 

Isolation and sequence of genomic clones, cDNAs and PGR 
products 
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5' amplification of cDNA was performed using the RACE 
system (Gibco/BRL) on total tomato leaf RNA with the 
oligonucleotide 5' -CATCAACACTGCCAAAC-3' (SEQ ID No. 3), 
derived from the CT151 RFLP marker sequence. A 0.8-lcb 
5 Polymerase Chain Reaction (PCR) product, containing the 5' 
end of the TDETl cDNA, was obtained after two different 
amplification reactions with Taql polymerase (Perkin Elmer) 
using two CT151-derived nested primers: 5'- 
GAAAGCAGCCGTTGCT-3' (SEQ ID No. 4) and 5'- 

10 AGTTCATCATCTTCACGGC-3' (SEQ ID No. 5), with the provided 
anchor primer (Gibco/BRL) The 0.8kb PCR fragment was 
directly sequenced on both strands with thermosequenase 
(Amersham) . 

Total cDNA from wild type tomato (cv. Money Maker) 

15 and the corresponding hp-2 and mutants were obtained 

by reverse transcription using avian myeloblastosis virus 
reverse transcriptase (Promega) of poly (A) mRNA isolated 
from leaves using oligo(dT) Dynabeads (Dynal) . From these 
samples the TDETl {HP-2) cDNA sequence was PCR amplified 

20 with the specific oligonucleotides: 

5' -GTATGATTCACTAGTTTAATGCTGCTGAAAG-3' (SEQ ID No. 6) and 
5' "-CCCATACTAGTCGTCTTGGCACTCTATCAAG-3^ (SEQ ID No. 7), using 
the Expand High Fidelity system (Boehringer Mannheim) . 
Subsequently the amplification product was subcloned in 

25 pBluescript as a Spel fragment and 4 independent clones 
were sequenced on both strands. 

Tomato genomic libraries in X-DASH and X-FIXII 
(kindly provided, respectively, by Jim Giovannoni, Texas 
A&M University, College Station, TX, and the Tomato Genome 

30 Center, Rehovot, Israel) were screened using standard 
methods (Sambrook et al . , 1989) with a 32p-iabelled CT151 
fragment as a probe. Overlapping fragments from different 
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recombinant phages were subcloned in pBluescript SK+ for 
sequencing of both strands. 

The 5' -GAAGGTAATTTTATATTAAA.CATAGAATAGA~3' (SEQ ID No. 
8) and 5' --GTGATTTCTAGGTTGATTTCAATCTAGA-3' (SEQ ID. No. 9) 
5 oligonucleotides were used to amplify the HP-2 gene 3" 
sequence from genomic DNA of the hp-2 mutant. The 1,300 bp 
amplification product was directly sequenced with the 
primer 5' -CAAATCGGTAACATAT-3' (SEQ ID No. 10) using a 
Thermosequenase kit (Amersham) . 

10 Binary plasmids for plant transformation 

DNA fragments containing the tomato fruit-specific E8 
promoter (Deikman and Fischer, 1988) were PGR amplified 
from S. lycopersicum total DNA with the following 
nucleotides: 5' -GGGGAAGCTTTTTCACGAAATCGGCCCTTA-3' (SEQ ID 

15 No. 11) and 5' -CCCGGATCCTTCTTTTGCACTGTGAATGATTAG-3' (SEQ ID 
No. 12). The 1.2kb amplification products were subcloned as 
Hindlll-BamHI fragments in the binary expression vector 
pmGFP4, derived from pBI121 (Bevan, 1984; Jeff erson et al . , 
1987; Haseloff et al . , 1997) in place of the 35S promoter, 

20 obtaining the plasmid pE8mGFP4. The mGFP4 gene was then 
excised and the inverted or complementary sequence of the 
HP-2 gene, or parts thereof, was inserted, as BamHI-SacI 
fragments, obtaining the plasmid pBIN-E8"HP2-ASl and pBIN- 
E8-HP2-AS2, respectively (Figure 9) . 

25 Results 

hp-2 mutant phenotypes 

hp-2 mutant seedlings exhibited an exaggerated light 
response with respect to wild type seedlings. Particularly, 
hypocotyl length was reduced (Figure la) and the content of 

30 anthocyanin pigments in the cotyledons was higher than in 
the wild type (Figure lb) . Such phenotypes were strictly 
dependent on the presence of light; in fact in the absence 
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of light the length of hypocotyl was not considerably 
different to the wild type (Figure la) and anthocyanins 
were not produced (data not shown) . The mutant 
exhibited a stronger phenotype than the mutant, with 

5 reference both to hypocotyl length and the content of 
anthocyanins (Figure 1) . 

Fruits obtained from plants of hp-2 and hp-2'^ mutants 
also exhibited exaggerated light responses. In the unripe 
fruits from hp-2 and mutants the content of 

10 chlorophyll was about five times higher than in fruits from 
wild-type plants, whereas in mature fruits the total 
content of carotenoids was about twice that of fruits from 
wild-type plants (Table 1) . 

Table 1 

15 Chlorophyll and carotenoid content of tomato fruits from 

wild type, and hp -2^ mutants . 

Genotype Total Chlorophylls Total Carotenoids 

(ripe fruit) (unripe fruit) 

20 Ug/g fresh weight pg/g fresh weight 

wild type 20 60 

(Money Maker) 

hp-2 90 124 

i2p-2^ 115 137 

25 The Phenotype of the Tomato hp-2 Mutant is Caused by 
Mutation in DETl 

The hp-2 mutation has been mapped previously using 
restriction fragment length polymorphism (RFLP) analyses of 
a segregating population derived from a cross between the 

30 hp-2 mutant (5. lycopersicum) and S. pennellii . Mapping 
data with a second backcross (BC-2) population indicated a 
position close to the centromere of chromosome 1, within a 
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cluster of several RFLP markers (Balint-Kurti et al . , 1995; 
Van Tuinen et al . , 1997; Broun and Tansksley, 1996). The 
authors of the present invention have found that the CT151 
RFLP marker (Tanksley et al • , 1992), included in this 
5 cluster, had high homology with the Arabldopsis DETl gene, 
mutations in which were previously identified as causing 
the deetiolated phenotype of detl mutants. 

Comparison of CT151 with Arahidopsis DETl revealed 
that CT151 has homology to the 3' end of DETl. The authors 

10 of the present invention isolated a full-length cDNA 
encoding TDETl using 5' rapid amplification of cDNA ends 
(Loh et al.^ 1989). Subsequently, the TDETl gene was 
isolated from genomic libraries in X-DASH and X-FIXII . 
Alignment of the TDETl cDNA and genomic sequences revealed 

15 the presence of 10 introns, which is similar to the number 
found in Arabldopsis (Figure 4a) . Nine introns are located 
in the same positions as those of the Arabldopsis DETl 
gene, whereas intron 2 of TDETl is not present in the 
Arabldopsis homolog (Figure 4a) (Pepper et al . ^ 1994). 

20 Comparative protein sequence analysis between Arabldopsis 
DETl and TDETl show 81.3% similarity and 74.8% identity 
(Figure 5) . No major differences are present between the 
sequences, except for a small deletion of 16 amino acids at 
the centre of TDETl , suggesting that the tomato and 

25 Arabldopsis genes are true homologs. A DETl homolog is not 
present in the yeast genome or in any prokaryotic genome 
sequenced to date (data not shown) . However, mouse and 
human expressed sequence tags (ESTs) with homology to DETl 
have been identified (Figure 5) , as well as a Drosophila 

30 homolog. 

Considering the presence of a nuclear localisation 
signal (NLS) (Robbins et al . , 1991) in the Arabldopsis DETl 
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amino acid sequence, and observing that the DETl-GUS 
chimeric protein is localised in the nucleus (Pepper et 
al,, 1994), DETl in Arabldopsis may be responsible for the 
transcriptional repression of photomorphogenesis in the 
5 absence of light. Considering the homology with HP-2 it is 
possible for the latter to have an analogous function in 
tomato. The TDETl protein has been found to be localized in 
the nucleus (data not shown) . 

TD^^ri -encoding cDNAs from the wild type, and hp~2 and 

10 hp-2j mutants were amplified with specific primers from 
leaf mRNA. Amplified fragments were subcloned in 
pBluescript and 4 independent clones were sequenced on both 
strands. In hp-2 the authors identified a substitution 
involving T in the place of C in exon 11 (nt. 1640) that 

15 causes a replacement of proline by serine in the protein C- 
terminal region (aa. 498) (Figure 4a) . In the hp-2 mutant 
the authors identified an alternative splicing site for 
intron 10, which causes a three amino acid deletion (Gly, 
Pro, Glu) at the start of exon 11, in the second NLS domain 

20 of the protein (aa. 478-480) (Figure 4a) . To find the 
mutation site which was responsible for the alternative 
splicing, the authors sequenced the 3' end of the TDETl gene 
from the hp-2 mutant, using a fragment obtained by PGR 
amplification of genomic DNA. A replacement of AG by TG in 

25 the consensus 3' splice junction of intron 10 was 
identified (Figure 4b) . These results demonstrate that the 
"'high pigment'' phenotype of the hp-2 and hp-2^ tomato 
mutants is caused by mutation of the TDETl gene mutation. 

Sequencing of the TDETl cDNAs derived from the hp-2 

30 mutant showed that the mutation produces two different 
splicing products from intron 10 (Figure 4b), suggesting 
the presence of a limited amount of wild-type TDETl protein 
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and that the hp-2 mutant is not an inactive allele. This is 
consistent with the observation that the hp-2 mutant allele 
is weaker than the allele (Figure 1 and Table 1) • 

Whether the hp-2^ mutation is null awaits further analysis, 
5 although the mutated proline residue in the hp-2^ mutant is 
conserved in both the plant sequences. However ;r because the 
hp-2 mutation is situated in the C-terminal domain of 
TDETl, as is the mutation in the Arabidopsis detl-5 mutant 
allele, which has a clearly de-etiolated phenotype (Pepper 
10 et al., 1994), it is apparent that the hp-2 mutant 
phenotypes are not comparable with the detl mutant 
phenotype in Arabidopsis . Furthermore, clearly weak alleles 
of det-I in Arabidopsis ^ eg* detl-1, display visible dark 
phenotypes . 

15 Comparison of Tomato hp-2 and Arabidopsis detl Mutants 

As is clear from the results presented, hp-2 mutants 
are phenotypically different from detl mutants* Most 
conspicuously they do not display dark phenotypes, such as 
reduced hypocotyl length, opened apical hooks, or enlarged 

20 cotyledons, whereas these were selection criteria for the 
isolation of det and cop in Arabidopsis (Chory et al . , 
1989; Deng et al . , 1991) . Conversely, detl mutants can even 
develop true leaves and floral buds in prolonged darkness. 
No such phenotypes can be observed in hp tomato seedlings 

25 grown in the dark. 

Arabidopsis detl mutants are also characterized by 
high level expression of light-regulated genes such as CHS 
and CAB in the dark, whereas in the light, CHS and CAB gene 
expression are similar to that observed in wild-type plants 

30 (Chory et al . , 1989). To examine the effects of hp-2 and 
hp-2i mutations on gene expression, the authors have 
performed RNA gel blot analysis of light-regulated gene 
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expression using CHS and CAB gene fragments as probes. In 
agreement with the weak phenotypes of hp-2 and hp-2j mutant 
seedlings grown in darkness no dramatic alteration of CHS 
and CAB gene expression was observed. Nonetheless, CHS and 
5 CAB mRNA levels were slightly higher in the allele 
when compared with hp-2 and wild-type seedlings (Figure 8) . 
Interestingly, CHS mRNA appeared to be of a slightly 
smaller size than that found in light-grown material, 
perhaps indicating a differential light-dependent splicing. 

10 Consistent with the exaggerated photoresponsiveness of hp-2 
and hp-2j mutants, CHS mRNA levels were significantly 
enhanced compared with wild-type seedlings following light 
irradiation for 48 hr (Figure 8) . hp-20 seedlings contained 
higher expression levels of CHS than hp-2, and CHS mRNA was 

15 particularly abundant in hypocotyls. In contrast to CHS, 
CAB mRNA levels were higher in cotyledons than in 
hypocotyls, and were slightly lower in light-exposed iip-2 
and hp-2j mutant seedlings when compared to the wild type 
(Figure 8) . In suirmary therefore, in tomato hp~2 mutants 

20 dark expression of CHS and CAB genes is only very slight 
and deregulation of gene expression is essentially light- 
dependent. This is in strong contrast to the CHS and CAB 
gene deregulation characteristics of Arahidopsis detl 
mutants (Pepper et al . , 1994). 

25 detl seedlings have been also reported to display a 

strongly enhanced expression of stress-related genes such 
as those encoding pathogenesis-related (PR) proteins and 
glutathione reductase (Mayer et al., 1996) To examine 
whether this was also the case in hp-2 and hp-20 mutants, 

30 RNA gel blots were hybridized with a probe encoding tomato 
PR-lbl (Tornero et al . ^ 1997). Although hypocotyls of hp-2 
seedlings reproducibly displayed PR-lbl gene expression at 
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slightly higher levels compared to wild-type seedlings, 
this effect was very weak compared to" that observed in 
Arabldopsls detl mutants and has never been observed in hp- 
2^ mutants (Figure 8) . 
5 Despite these above-mentioned differences, plastid 

development in darkness in the cotyledons of hp-2 and hp~2^ 
seedlings was found by the authors to be similar to that 
observed in Arabidopsis detl mutants (data not shown; Chory 
et al* , 1989) . 

10 Cytokinin can Phenocopy the hp Mutant 

Previous observations in Arabidopsis have shown that 
a detl phenotype can be phenocopied by the exogenous 
application of cytokinin (Chory et al . , 1994). 

To determine the effects of cytokinin treatment in 

15 tomato, the authors treated wild-type seedlings with 
different concentrations of cytokinin in the dark and in 
the light (Figure 6) . Cytokinin was found not to phenocopy 
a detl mutation in tomato. In dark-grown seedlings, 
although hypocotyls are shorter in the presence of 

20 cytokinin, apical hook opening, cotyledon expansion, and 
anthocyanin biosynthesis were not observed (Figure 6) , even 
after prolonged periods (up to three weeks) (data not 
shown) • However, in the light, cytokinin can phenocopy the 
hp mutation: seedlings displayed shorter and thicker 

25 hypocotyls and accumulated high levels of anthocyanin 
(Figure 6) . 

These results therefore indicate that at least as 
far as the effects of cytokinin are concerned, the hp 
mutation in tomato is equivalent to the detl mutation in 
30 Arabidopsis. Such a supposition is consistent with the fact 
that no constitutive de-etiolated mutants have heretofore 
been identified in tomato and CHS gene expression and 



wo 99/29866 



23 



PCT/IT98/00350 



consequent biosynthetic build-up of anthocyanin pigments is 
strictly dependent on the light, whereas in Arabidopsis 
anthocyanins can be produced also in the absence of light 
(Mustilli and Bowler, 1997) . 
5 Double Mutant Analyses 

Double mutant analysis with deti and phytochrome- 
deficient mutants in Arabidopsis indicate that the deti 
mutation is completely epistatic to photoreceptor mutations 
(Chory, 1992) . This has been interpreted as meaning that 

10 DETI acts downstream of phytochrome. To determine the 
relationship between TDETl and phytochrome function, we 
examined the effects of the hp-2 mutation in the aurea 
mutant background, a phytochrome chromophore-def icient 
mutant (Terry and Kendrick, 1996) . In contrast to its 

15 counterpart double mutant in Arabidopsis, au hp-'2 is 
similar to the single mutant au. For example, hypocotyls 
are elongated and anthocyanin accumulation is limited 
(Figure 7) . Although in genetic terms this result therefore 
infers that au is epistatic to hp-2r the authors 

20 nonetheless consider it more likely that TDETl acts 
downstream of phytochrome, as proposed for Arabidopsis , but 
that its activity as a negative regulator is strictly 
dependent upon the presence of active phytochrome. This 
requirement is clearly not observed in Arabidopsis (Chory, 

25 1992) . The small but significant reduction in hypocotyl 
length and the small increase in anthocyanin found in the 
au hp-2 double mutant compared with au (Figure 7) is likely 
therefore to be a result of hypersensitivity caused by the 
hp-2 mutation towards the low amounts of functional 

30 phytochrome present in the au mutant, which is estimated 
to be about 3% wild-type levels (Adams et al . , 1989; Parks 
et al, 1987) . Therefore the hp-2 phenotype in tomato is 
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strictly dependent on the presence of phytochrome, whereas 
in Arabidopsis the detl mutation is independent of the 
presence or absence of functional photoreceptors* 

The absence of de-etiolated mutants in tomato and the 
5 strict light- and phytochrome- dependence of the hp mutant 
phenotype may suggest that in tomato TDETl function is 
either redundant or that the signal transduction pathways 
regulating its activity operate in different ways in the 
two plants (Mustilli and Bowler, 1997) . Because by Southern 

10 analysis the authors have shown that in tomato^ like in 
Arabidopsis (Pepper et al., 1994), only one DETl homologous 
gene is present (data not shown) , the latter hypothesis 
seems to be more likely. 

Based upon the author's findings, it is likely that 

15 only a few key regulators will be found to be responsible 
for controlling light responses in all higher plants. 
Therefore it would be very interesting to clone other 
tomato genes encoding proteins homologous to COP/DET/FUS 
and examine whether modification of their activity also 

20 results in light hypersensitive phenotypes . 

Binary plasmids for modulation of TDETl activity 

In order to generate tomato plants with an hp-2 
phenotype specifically in the fruit, the authors 
constructed plasmids which are able to produce the TDETl 

25 antisense RNA sequence, or portions thereof, only in the 
fruit by using the regulatory sequences of the tomato E8 
gene (Figure 9) . 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

5 (i) APPLICANT: 

(A) NAME: Stazione Zoologica "Anton Dohrn" 

(B) STREET: Villa Comunale 

(C) CITY: Naples 
(E) COUNTRY: Italy 

10 (F) POSTAL CODE (ZIP) : 80121 

(ii) TITLE OF INVENTION: Nucleotide sequences encoding the tomato 
light hypersensitive phenotype, encoded proteins and uses thereof 

15 (iii) NUMBER OF SEQUENCES: 12 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

20 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 2000 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
30 (ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 149. . 1720 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
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10 



CTTCCCTCTT AGACTTTATC GATCCTAATT CGAGCCCTCC TTTTTTCAAT CAATTATCAA 60 

TTTAGTCCTA CTGCGATTTT GATATGTATG ATTCACAATT TTAATGCTGC TGAAAGCAAT 120 

TATATAAAAG CTGAAACATT TTGCACTG ATG TTC AAA ACT AAC AAT GTT ACC 172 

Met Phe Lys Thr Asn Asn Val Thr 

1 5 

GCC AGG CTT TTT GAG CGC CAG ATT TGC ACC CCT GCT CCT GGC ACC AGC 220 
Ala Arg Leu Phe Glu Arg Gin lie Cys Thr Pro Ala Pro Gly Thr Ser 
10 15 20 

ATC CAT CGT GCC AGA AGA TTT TAT GAG AAC GTT GTA CCA AGT TAT ACC 268 
lie His Arg Ala Arg Arg Phe Tyr Glu Asn Val Val Pro Ser Tyr Thr 
25 30 35 40 

15 ATA TAG GAT GTT GAA TGT CCC GAG GAT TGA TTT CGC AAG TTC ACG GAT 316 

lie Tyr Asp Val Glu Cys Pro Asp His Ser Phe Arg Lys Phe Thr Asp 
45 50 55 

GAC GGT CTA TAT TTT GTA AGT TTC AGC CGA AAC CAT CAG GAT CTG GTT 364 
20 Asp Gly Leu Tyr Phe Val Ser Phe Ser Arg Asn His Gin Asp Leu Val 

60 65 70 

GTT TAT AGA CCA ACA TGG CTG ACA TTT TCC TGC AAA GAA GAA GAT TGT 412 
Val Tyr Arg Pro Thr Trp Leu Thr Phe Ser Cys Lys Glu Glu Asp Cys 
25 75 80 85 

GAT ACT CAT GAT CTT CCT TTG AAA GCT AGA AAG TTT GAG AGC TTC TTC 460 
Asp Thr His Asp Leu Pro Leu Lys Ala Arg Lys Phe Glu Ser Phe Phe 
90 95 100 



30 



ACA CAG TTG TAG AGT GTT ACT CTT GGT TGT AGT GGG GAA GTT ATA TGG 50 8 

Thr Gin Leu Tyr Ser Val Thr Leu Ala Ser Ser Gly Glu Leu lie Cys 
105 110 115 120 
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AAA GAT TTC TTT CTC TAT ATG GAG AGC AAC CAA TTT GGA CTC TTT GCA 556 
Lys Asp Phe Phe Leu Tyr Met Glu Ser Asn Gin Phe Gly Leu Phe Ala 
125 130 135 

5 ACT TCA ACT GCA CAA ATT CAT GAT GCA CCT CCT ACT GGA GGG GCA ATT 604 

Thr Ser Thr Ala Gin lie His Asp Ala Pro Pro Thr Gly Gly Ala lie 
140 145 150 

CAG GGA GTC CCT TCA GTT GAA AAA ATA ACT TTC CAC CTT TTG AGG TTG 652 
10 Gin Gly Val Pro Ser Val Glu Lys lie Thr Phe His Leu Leu Arg Leu 

155 160 165 

GTG GAT GGA GCT ATA CTT GAC GAA AGG GTT TTC CAC AAT GAT TAT GTT 700 
Val Asp Gly Ala lie Leu Asp Glu Arg Val Phe His Asn Asp Tyr Val 
15 170 175 180 

AAT TTG GCA CAT AGC ATT GGT GCT TTC TTG TAT GAT GAT TTG CTT GCT 74 8 

Asn Leu Ala His Ser lie Gly Ala Phe Leu Tyr Asp Asp Leu Leu Ala 
185 190 195 200 

20 

ATA GTG TCT CTT CGT TAT CAA AGA ATA CAC ATC CTT CAG ATC AGA GAT 796 

lie Val Ser Leu Arg Tyr Gin Arg lie His lie Leu Gin lie Arg Asp 
205 210 215 

25 TCT GGA GAT CTT GTT GAT GTA CGA GCA ATT GGG GAA TTC TGC CGT GAA 844 

Ser Gly Asp Leu Val Asp Val Arg Ala lie Gly Glu Phe Cys Arg Glu 
220 225 230 

GAT GAT GAA CTT TTT CTC AAT TCC AAT TCC CAG GTG CTT GTA AAT CAT 892 
30 Asp Asp Glu Leu Phe Leu Asn Ser Asn Ser Gin Val Leu Val Asn His 

235 240 245 

GTT GGA AAT GGT TTT CAT CAT AGT CTG CCT CAA TCA GAA ACT TCT TTC 940 
Val Gly Asn Gly Phe His His Ser Leu Pro Gin Ser Glu Thr Ser Phe 
35 250 255 260 
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CTG AGC GGT ATA AAG CAA CGG CTG CTT 
Leu Ser Gly lie Lys Gin Arg Leu Leu 
265 270 

5 TGG AAT GAA GCT GAC CAA ACC ATG AGA 

Trp Asn Glu Ala Asp Gin Thr Met Arg 
285 

TTT TAG TTC CAC TTT CAA GAT TAG ATT 
10 Phe Tyr Phe His Phe Gin Asp Tyr lie 

300 305 



TCA TAT ATA TTT CGA GGT ATA 98 8 
Ser Tyr lie Phe Arg Gly lie 
275 280 

GTG CAG TGC CTG AAG AAG AAG 1036 
Val Gin Cys Leu Lys Lys Lys 
290 295 

GAC TTG ATT ATC TGG AAG GTG 1084 
Asp Leu lie lie Trp Lys Val 
310 



CAG TTT TTG GAC CGA CAT 
Gin Phe Leu Asp Arg His 
15 315 

GGT GGG GTA TCC CGA AAT 
Gly Gly Val Ser Arg Asn 
330 

20 

TAC AAT ATG GAG ACT ACT 
Tyr Asn Met Glu Thr Thr 
345 350 



CAC CTG TTG ATC AAG TTT 
His Leu Leu lie Lys Phe 
320 

GCT GAC ATC CAT CCT TCT 
Ala Asp lie His Pro Ser 
335 340 

GAA ATT GTT GCA TTT TAT 
Glu lie Val Ala Phe Tyr 
355 



GGC AGT GTT GAT 1132 

Gly Ser Val Asp 

325 

TTT TTT GCT GTT 1180 
Phe Phe Ala Val 

CAG AAC TCA GCC 1228 
Gin Asn Ser Ala 
360 



25 GAT GAG CTT TAT TTC TTG TTC GAG CTG TTC AGC GAC CAT TTT CAC GTT 1276 

Asp Glu Leu Tyr Phe Leu Phe Glu Leu Phe Ser Asp His Phe His Val 
365 370 375 

TCA TCC AAA AGT TCA TTA CAT ATG AAC TTC ATG TCC TCA CAC TCA AAC 1324 
30 Ser Ser Lys Ser Ser Leu His Met Asn Phe Met Ser Ser His Ser Asn 

380 385 390 



35 



AAC ATC CAC GCC CTC GAG CAA CTA AGG TGT ACA AAG AAC AAA GCA ACC 
Asn lie His Ala Leu Glu Gin Leu Arg Cys Thr Lys Asn Lys Ala Thr 
395 400 405 



1372 
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AAT TTC TCT CAA TTT GTT AAG AAA ATG ATG GCT TCC TTG CCT TGT AGT 1420 
Asn Phe Ser Gin Phe Val Lys Lys Met Met Ala Ser Leu Pro Cys Ser 
410 415 420 

TGT CAG TCT CAG AGT CCT TCC CCA TAT TTT GAC CAA TCT CTC TTC AGG 14 68 
Cys Gin Ser Gin Ser Pro Ser Pro Tyr Phe Asp Gin Ser Leu Phe Arg 
425 430 435 440 

TTT GAC GAG AAG CTT ATT TCA GCT ATT GAC CGC CAT AGA CAG TCT ACT 1516 
Phe Asp Glu Lys Leu lie Ser Ala lie Asp Arg His Arg Gin Ser Thr 
445 450 455 

GAC CAT CCA ATC AAA TTC ATT TCT AGA AGA CAA CCC AAT ATC CTG AAA 1564 
Asp His Pro lie Lys Phe lie Ser Arg Arg Gin Pro Asn lie Leu Lys 
460 465 470 

TTC AAA ATG AAG CCA GGA CCT GAA GCT GGC AGC ACA GAT GGG CGA ACT 1612 
Phe Lys Met Lys Pro Gly Pro Glu Ala Gly Ser Thr Asp Gly Arg Thr 
475 480 485 

AAG AAG ATC TGT TCC TTC CTC TTC CAC CCA ATA TTA CCC CTT GCA CTT 1660 
Lys Lys lie Cys Ser Phe Leu Phe His Pro lie Leu Pro Leu Ala Leu 
490 495 500 

TCT GTT CAA CAA ACC TTG TTT CTG CAG GCA TCA GTT GTA AAT ATC CAT 1708 
Ser Val Gin Gin Thr Leu Phe Leu Gin Ala Ser Val Val Asn lie His 
505 510 515 520 

TTT CGT CGA TAA TGTAAAAACT TAATTTATAT GTTACCGATT TGTTTATAAA 1760 
Phe Arg Arg * 

TTTCTCTAAT AACCTCTAGA TTGAAATCAA CCTAGAAATC ACAAATTCAT CATAACAGAC 182 0 
CCGTAGATGC TAGTGTCTTT GACTTCTACA TTTTCTTTGT TACAAGAATC AAACAAATGC 18 8 0 
TTGATAGAGT GCCAAGACGG TTAGTATGGG TATAAGGATT AGTTCTTCTG TAAGTTTTTG 194 0 
TTACAGCTTC TCTTCTAATT AATTGATGTA CATTCAGATG TTAAAAAAAA AAAAAAAAAA 2000 
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(2) INFORMATION FOR SEQ ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 524 amino acids 

(B) TYPE: amino acid 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
Met Phe Lys Thr Asn Asn Val Thr Ala Arg Leu Phe Glu Arg Gin lie 
15 10 15 

10 Cys Thr Pro Ala Pro Gly Thr Ser lie His Arg Ala Arg Arg Phe Tyr 

20 25 30 

Glu Asn Val Val Pro Ser Tyr Thr lie Tyr Asp Val Glu Cys Pro Asp 

35 40 45 

His Ser Phe Arg Lys Phe Thr Asp Asp Gly Leu Tyr Phe Val Ser Phe 
15 50 55 60 

Ser Arg Asn His Gin Asp Leu Val Val Tyr Arg Pro Thr Trp Leu Thr 
65 70 75 80 

Phe Ser Cys Lys Glu Glu Asp Cys Asp Thr His Asp Leu Pro Leu Lys 
85 90 95 

20 Ala Arg Lys Phe Glu Ser Phe Phe Thr Gin Leu Tyr Ser Val Thr Leu 

100 105 110 

Ala Ser Ser Gly Glu Leu lie Cys Lys Asp Phe Phe Leu Tyr Met Glu 

115 120 125 

Ser Asn Gin Phe Gly Leu Phe Ala Thr Ser Thr Ala Gin lie His Asp 
25 130 135 140 

Ala Pro Pro Thr Gly Gly Ala lie Gin Gly Val Pro Ser Val Glu Lys 
145 150 155 160 

lie Thr Phe His Leu Leu Arg Leu Val Asp Gly Ala lie Leu Asp Glu 
165 170 175 

30 Arg Val Phe His Asn Asp Tyr Val Asn Leu Ala His Ser lie Gly Ala 

180 185 190 

Phe Leu Tyr Asp Asp Leu Leu Ala lie Val Ser Leu Arg Tyr Gin Arg 

195 200 205 

lie His lie Leu Gin lie Arg Asp Ser Gly Asp Leu Val Asp Val Arg 
35 210 215 220 
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Ala lie Gly Glu 
225 

Asn Ser Gin Val 

Leu Pro Gin Ser 
260 

Leu Ser Tyr lie 
275 

Arg Val Gin Cys 
290 

lie Asp Leu lie 
305 

Leu lie Lys Phe 

lie His Pro Ser 

340 

Val Ala Phe Tyr 
355 

Leu Phe Ser Asp 
370 

Asn Phe Met Ser 
385 

Arg Cys Thr Lys 

Met Met Ala Ser 
420 

Tyr Phe Asp Gin 
435 

lie Asp Arg His 
450 

Arg Arg Gin Pro 
465 

Ala Gly Ser Thr 

His Pro lie Leu 
500 



Phe Cys Arg Glu 
230 

Leu Val Asn His 
245 

Glu Thr Ser Phe 

Phe Arg Gly lie 
280 

Leu Lys Lys Lys 
295 

lie Trp Lys Val 
310 

Gly Ser Val Asp 

325 

Phe Phe Ala Val 

Gin Asn Ser Ala 
360 

His Phe His Val 
375 

Ser His Ser Asn 
390 

Asn Lys Ala Thr 
405 

Leu Pro Cys Ser 

Ser Leu Phe Arg 
440 

Arg Gin Ser Thr 
455 

Asn lie Leu Lys 

470 

Asp Gly Arg Thr 
485 

Pro Leu Ala Leu 



Asp Asp Glu Leu 
235 

Val Gly Asn Gly 
250 

Leu Ser Gly lie 
265 

Trp Asn Glu Ala 

Phe Tyr Phe His 
300 

Gin Phe Leu Asp 

315 

Gly Gly Val Ser 

330 

Tyr Asn Met Glu 
345 

Asp Glu Leu Tyr 

Ser Ser Lys Ser 
380 

Asn lie His Ala 
395 

Asn Phe Ser Gin 
410 

Cys Gin Ser Gin 
425 

Phe Asp Glu Lys 

Asp His Pro lie 

460 

Phe Lys Met Lys 
475 

Lys Lys lie Cys 
490 

Ser Val Gin Gin 
505 



Phe Leu Asn Ser 
240 

Phe His His Ser 
255 

Lys Gin Arg Leu 
270 

Asp Gin Thr Met 
285 

Phe Gin Asp Tyr 

Arg His His Leu 

320 

Arg Asn Ala Asp 
335 

Thr Thr Glu lie 
350 

Phe Leu Phe Glu 
365 

Ser Leu His Met 

Leu Glu Gin Leu 
400 

Phe Val Lys Lys 
415 

Ser Pro Ser Pro 
430 

Leu lie Ser Ala 
445 

Lys Phe lie Ser 

Pro Gly Pro Glu 
480 

Ser Phe Leu Phe 
495 

Thr Leu Phe Leu 
510 



wo 99/29866 



36 



PCT/IT98/00350 



Gin Ala Ser Val Val Asn lie His Phe Arg Arg * 
515 520 

(2) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CATCAACACT GCCAAAC 

(2) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GAAAGCAGCC GTTGCT 

(2) INFORMATION FOR SEQ ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
AGTTCATCAT CTTCACGGC 

(2) INFORMATION FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GTATGATTCA CTAGTTTAAT GCTGCTGAAA G "31 

(2) INFORMATION FOR SEQ ID NO; 7: 
5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : single 
(D) TOPOLOGY: linear 
10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CCCATACTAG TCGTCTTGGC ACTCTATCAA G 31 

(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
20 GAAGGTAATT TTATATTAAA CATAGAATAG A 31 

(2) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 8 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GTGATTTCTA GGTTGATTTC AATCTAGA 28 

30 

(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CAAATCGGTA ACATAT 16 

(2) INFORMATION FOR SEQ ID NO: 11: 
5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGGGAAGCTT TTTCACGAAA TCGGCCCTTA 30 

(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
2 0 CCCGGATCCT TCTTTTGCAC TGTGAATGAT TAG 33 
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CLAIMS 

1. Nucleic acid comprising a nucleotide sequence 
encoding a protein, or functional portions thereof, which, 
if altered, is responsible for the light hypersensitive 

5 mutant phenotype in Solarium lycoperslcum plants, said 
phenotype comprising a reduced growth of the plant 
associated with high levels of carotenoids and/or 
cholophylls and/or flavonoids, 

2. Nucleic acid according to claim 1 encoding a tomato 
10 protein homologous to at least one of the Arahidopsis 

COP/DET/FUS family proteins, provided that it is 
responsible for the light hypersensitive mutant phenotype. 

3. Nucleic acid according to claim 2 comprising the 
nucleotide sequence encoding the TDETl {HP-2) protein, or 

15 functional portions thereof, 

4. Nucleic acid according to claim 3 comprising a 
nucleotide sequence encoding the protein having the amino 
acid sequence of SEQ ID No. 2, or functional portions 
thereof. 

20 5. Nucleic acid according to claim 4 comprising a 

nucleotide sequence comprised in SEQ ID No. 1. 

6. Nucleic acid according to claim 5 wherein the 
nucleotide sequence comprised in SEQ ID No. 1 is the 
sequence from nt. 149 to nt . 1720, or portions thereof. 
25 7. Nucleic acid comprising a nucleotide sequence 

complementary to SEQ ID No. 1 or portions thereof. 

8. Nucleic acid according to claim 3 or 4 comprising a 
mutation which is able to induce the light hypersensitive 
phenotype . 

30 9. Nucleic acid according to claim 8 wherein the SEQ 

ID No. 1 comprises at least a T as a substitute for C at 
position 1640. 
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10. Nucleic acid according to claim 8 wherein the SEQ 
ID No. 1 comprises at least a nucleotide deletion from nt. 
1581 to nt. 1589. 

11. Expression vector comprising the nucleic acid 
5 according to any one of preceding claims under the control 

of a promoter that is active in plants. 

12. Vector according to claim 11 wherein said promoter 
is selectively active only in a few plant organs. 

13. Vector according to claim 12 wherein said promoter 
10 is active in the fruits. 

14. Vector according to any one of preceding claims 
11-13 which is able to control the transcription of an 
antisense RNA. 

15. Vector according to claim 14 wherein said vector 
15 is the PBIN-E8-HP2-AS1 vector or the pBIN-E8-HP2-ASl 

vector. 

16. Use of vector according to any one of preceding 
claims 11-15 to produce transgenic plants containing the 
nucleic acid according to any one of preceding claims 1-10, 

20 under the control of specific regulatory sequences, 
preferably in selected organs of plants. 

17. Use of vector according to claim 16 wherein said 
transgenic plant belongs to pepper, eggplant, soybean, 
grape, melon, rice, carrot, spinach, citrus, pomaceae or 

25 ornamental species. 

18. Transgenic plant obtainable by the use of 
transforming vector according to any one of the preceding 
claims 11-15. 

19. Transgenic plant according to claim 18 belonging 
30 to pepper, eggplant, soybean, grape, melon, rice, carrot, 

spinach, citrus, pomaceae and ornamental species. 
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20. Protein, or portions thereof, which, if altered, 
is responsible for the light hypersensitive phenotype in 
Solanum lycoperslcum plants, 

21. A protein according to claim 20 homologous with at 
5 least one of the Arabidopsis COP/DET/FUS family proteins, 

whose alteration is responsible for a light hypersensitive 
phenotype . 

22. Protein according to claim 21 comprising the amino 
acid sequence of SEQ ID No . 2, or portions thereof. 

10 23. Protein according to claim 22 comprising an 

alteration to the amino acid sequence of SEQ ID No. 2, or 
portions thereof, which are able to induce a light 
hypersensitive phenotype. 

24. Protein according to claim 23 comprising at least 
15 an alteration in the SEQ ID No. 1 C- terminal portion. 

25. Protein according to claim 24 wherein said 
alteration in the C-terminal portion comprises a 
substitution of proline at 498 position. 

26. Protein according to claim 25 wherein said 
20 substitution comprises a serine as a substitute for 

proline . 

27. Protein according to claim 24 wherein said 
alteration in the C-terminal portion comprises a deletion 
of at least one amino acid in the second NLS domain. 

25 28. Protein according to claim 27 wherein said 

deletion comprises at least three amino acids. 

29. Protein according to claim 28 wherein said 
deletion comprising at least three amino acids is from aa. 
478 to aa. 480. 
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10 






1 


cttccctctt 


agaccttatc 


gatcctaatt 


cgagccctcc 


ttttttcaat 


51 


caattatcaa 


tttagtccta 


ctgcgatttt 


gatatgtatg 


attcacaatt 


101 


ttaatgctgc 


tgaaagcaat 


tatataaaag 


ctgaaacatt 


ttgcactgAT 


151 


GTTCAAAACT 


AACAATGTTA 


CCGCCAGGCT 


TTTTGAGCGC 


CAGATTTGCA 


201 


CCCCTGCTCC 


TGGCACCAGC 


ATCCATCGTG 


CCAGAAGATT 


TTATGAGAAC 


251 


GTTGTACCAA 


GTTATACCAT 


ATACGATGTT 


GAATGTCCCG 


ACCATTCATT 


301 


TCGCAAGTTC 


ACGGATGACG 


GTCTATATTT 


TGTAAGTTTC 


AGCCGAAACC 


351 


ATCAGGATCT 


GGTTGTTTAT 


AGACCAACAT 


GGCTGACATT 


TTCCTGCAAA 


401 


GAAGAAGATT 


GTGATACTCA 


TGATCTTCCT 


TTGAAAGCTA 


GAAAGTTTGA 


451 


GAGCTTCTTC 


ACACAGTTGT 


ACAGTGTTAC 


TCTTGCTTCT 


AGTGGGGAAC 


501 


TTATATGCAA 


AGATTTCTTT 


CTCTATATGG 


AGAGCAACCA 


ATTTGGACTC 


551 


TTTGCAACTT 


CAACTGCACA 


AATTCATGAT 


GCACCTCCTA 


CTGGAGGGGC 


601 


AATTCAGGGA 


GTCCCTTCAG 


TTGAAAAAAT 


AACTTTCCAC 


CTTTTGAGGT 


651 


TGGTGGATGG 


AGCTATACTT 


GACGAAAGGG 


TTTTCCACAA 


TGATTATGTT 


701 


AATTTGGCAC 


ATAGCATTGG 


TGCTTTCTTG 


TATGATGATT 


TGCTTGCTAT 


751 


AGTGTCTCTT 


CGTTATCAAA 


GAATACACAT 


CCTTCAGATC 


AGAGATTCTG 


801 


GAGATCTTGT 


TGATGTACGA 


GCAATTGGGG 


AATTCTGCCG 


TGAAGATGAT 


851 


GAACTTTTTC 


TCAATTCCAA 


TTCCCAGGTG 


CTTGTAAATC 


ATGTTGGAAA 


901 


TGGTTTTCAT 


CATAGTCTGC 


CTCAATCAGA 


AACTTCTTTC 


CTGAGCGGTA 


951 


TAAAGCAACG 


GCTGCTTTCA 


TATATATTTC 


GAGGTATATG 


GAATGAAGCT 


1001 


GACCAAACCA 


TGAGAGTGCA 


GTGCCTGAAG 


AAGAAGTTTT 


ACTTCCACTT 


1051 


TCAAGATTAC 


ATTGACTTGA 


TTATCTGGAA 


GGTGCAGTTT 


TTGGACCGAC 


1101 


ATCACCTGTT 


GATCAAGTTT 


GGCAGTGTTG 


ATGGTGGGGT 


ATCCCGAAAT 


1151 


GCTGACATCC 


ATCCTTCTTT 


TTTTGCTGTT 


TACAATATGG 


AGACTACTGA 


1201 


AATTGTTGCA 


TTTTATCAGA 


ACTCAGCCGA 


TGAGCTTTAT 


TTCTTGTTCG 


1251 


AGCTGTTCAG 


CGACCATTTT 


CACGTTTCAT 


CCAAAAGTTC 


ATTACATATG 


1301 


AACTTCATGT 


CCTCACACTC 


AAACAACATC 


CACGCCCTCG 


AGCAACTAAG 


1351 


GTGTACAAAG 


AACAAAGCAA 


CCAATTTCTC 


TCAATTTGTT 


AAGAAAATGA 


1401 


TGGCTTCCTT 


GCCTTGTAGT 


TGTCAGTCTC 


AGAGTCCTTC 


CCCATATTTT 


1451 


GACCAATCTC 


TCTTCAGGTT 


TGACGAGAAG 


CTTATTTCAG 


CTATTGACCG 


1501 


CCATAGACAG 


TCTACTGACC 


ATCCAATCAA 


ATTCATTTCT 


AGAAGACAAC 


1551 


CCAATATCCT 


GAAATTCAAA 


ATGAAGCCAG 


GACCTGAAGC 


TGGCAGCACA 


1601 


GATGGGCGAA 


CTAAGAAGAT 


CTGTTCCTTC 


CTCTTCCACC 


CAATATTACC 


1651 


CCTTGCACTT 


TCTGTTCAAC 


AAACCTTGTT 


TCTGCAGGCA 


TCAGTTGTAA 


1701 


ATATCCATTT 


TCGTCGATAA 


tgtaaaaact 


taatttatat 


gttaccgatt 


1751 


tgtttataaa 


tttctctaat 


aacctctaga 


ttgaaatcaa 


cctagaaatc 



Fig. 2 
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1801 acaaattcat cataacagac ccgtagatgc tagtgtcttt gacttctaca 

1851 ttttctttgt tacaagaatc aaacaaatgc ttgatagagt gccaagacgg 

1901 ttagtatggg tataaggatt agttcttctg taagtttttg ttacagcttc 

1951 tcttctaatt aattgatgta cattcagatg tttaaaaaaa aaaaaaaaaa 



Fig. 2 
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1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 



MFKTNNVTAR LFERQICTPA PGTSIHRARR FYENWPSYT lYDVECPDHS 
FRFPTDDGLY FVSFSRlsIHQD LWYRPTWLT FSCKEEDCDT HDLPLKARKF 
ESFFTQLYSV TLASSGELIC KDFFLYMESN QFGLFATSTA QIHDAPPTGG 
AIQGVPSVEK ITFHLLRLVD GAILDERVFH NDYVNLAHSI GAFLYDDLLA 
IVSLRYQRIH ILQIRDSGDL VDVRAIGEFC KEDDELFUSTS NSQVLVNHVG 
NGFHHSLPQS ETSFLSGIKQ RLLSYIFRGI WISEADQTMRV QCLKKKFYFH 
FQDYIDLIIW KVQFLDRHHL LIKFGSVDGG VSRNADIHPS FFAVYNMETT 
EIVAFYQNSA DELYFLFELF SDHFHVSSKS SUiMNFMSSH SNNIHALEQL 
RCTKNKATNF SQFVKKMMAS LPCSCQSQSP SPYFDQSLFR FDEKLISAID 
RHRQSTDHPI KFISRRQPNI IxPCFKMKPGPE AGSTDGRTKK ICSFIiFHPIL 
PLAIiSVQQTL FLQASWNIH FRR 
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